Fast Homology Search using Categorisation Profiles
نویسندگان
چکیده
Homology search is an important step in discovering evolutionary relationships in modern molecular biology. In particular, it is key to analysing data from large-scale sequencing initiatives: by establishing homology between a newly-discovered sequence and well-understood sequences in curated repositories, it is possible to infer structure and function without the need for costly, time-consuming wet laboratory work. Biologists almost always search protein collections in preference to nucleotide collections because, in general, proteins are better annotated and, therefore, more useful in discovering evolutionary relationships. It is possible to deduce the function of a protein based on its similarity to sequences that have been previously characterised by comparing the primary or linear structure of a protein. Indeed, more than half of the data derived from newly-sequenced genomes can be characterised based on its similarity to other well-understood organisms [5]. As new genomes are sequenced and annotated, function elucidation for this new data becomes increasingly possible.
منابع مشابه
Invariant Categorisation of Polygonal Objects using Multi-resolution Signatures
With the increasing use of 3D objects and models, mining of 3D databases is becoming an important issue. However, 3D object recognition is very time consuming because of variations due to position, rotation, size and mesh resolution. A fast categorisation can be used to discard non-similar objects, such that only few objects need to be compared in full detail. We present a simple method for cha...
متن کاملRapid similarity search of proteins using alignments of domain arrangements
MOTIVATION Homology search methods are dominated by the central paradigm that sequence similarity is a proxy for common ancestry and, by extension, functional similarity. For determining sequence similarity in proteins, most widely used methods use models of sequence evolution and compare amino-acid strings in search for conserved linear stretches. Probabilistic models or sequence profiles capt...
متن کاملORFeus: detection of distant homology using sequence profiles and predicted secondary structure
ORFeus is a fully automated, sensitive protein sequence similarity search server available to the academic community via the Structure Prediction Meta Server (http://BioInfo.PL/Meta/). The goal of the development of ORFeus was to increase the sensitivity of the detection of distantly related protein families. Predicted secondary structure information was added to the information about sequence ...
متن کاملThe Role of Automated Categorisation in e-Government Information Retrieval
High-precision search results are essential for supporting e-government employees’ information tasks. Prior studies have shown that existing features of e-government retrieval systems need improvement in terms of search facilities, navigation and metadata. This paper investigates how automated categorisation can enhance information organisation and retrieval and presents the results of a realis...
متن کاملCPHmodels-3.0—remote homology modeling using structure-guided sequence profiles
CPHmodels-3.0 is a web server predicting protein 3D structure by use of single template homology modeling. The server employs a hybrid of the scoring functions of CPHmodels-2.0 and a novel remote homology-modeling algorithm. A query sequence is first attempted modeled using the fast CPHmodels-2.0 profile-profile scoring function suitable for close homology modeling. The new computational costly...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005